Filtering the UMLS ® Metathesaurus ® for MetaMap 2010
نویسنده
چکیده
The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2010AA version of the Metathesaurus, for example, the Metathesaurus includes 5,394,495 English strings, 5,338,590 (98.96%) of them distinct, comprising 2,194,659 concepts. There are 2.20% more English strings and 3.51% more concepts than in the 2009AA edition. Many of the strings in the Metathesaurus are of little value to MetaMap for one of four reasons: 1. Some strings are virtually indistinguishable from each other; for efficiency, only one representative of a set of indistinguishable strings is needed. 2. Some strings either represent general, nonmedical concepts, are unnecessarily ambiguous, or have been found to be problematic for some other reason. 3. Some strings have an assigned type in their vocabulary because they have a form (e.g., an idiosyncratic abbreviation) that is highly unlikely to appear in regular text. 4. Some strings, including lengthy descriptions of things such as procedures, health activities or medical devices, are so complicated that it is again unlikely to find them in normal text.
منابع مشابه
Filtering the UMLS ® Metathesaurus ® for MetaMap 2012 Edition
The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2011AA version of the Metathesaurus...
متن کاملFiltering the UMLS® Metathesaurus® for MetaMap 1999 Edition
MetaMap’s primary purpose is to provide a basis for further processing of biomedical text by finding the Metathesaurus concepts referred to in the text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has su...
متن کاملFiltering the UMLS ® Metathesaurus ® for MetaMap 2009
The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2009AA version of the Metathesaurus...
متن کاملFiltering the UMLS ® Metathesaurus ® for MetaMap 2011 Edition Francois
The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2011AA version of the Metathesaurus...
متن کاملImproving Summarization of Biomedical Documents Using Word Sense Disambiguation
We describe a concept-based summarization system for biomedical documents and show that its performance can be improved using Word Sense Disambiguation. The system represents the documents as graphs formed from concepts and relations from the UMLS. A degree-based clustering algorithm is applied to these graphs to discover different themes or topics within the document. To create the graphs, the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1991